AITopics | short term memory

2411.15832

Country:

North America > United States > Texas (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > New York > Erie County > Buffalo (0.04)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Diagnostic Medicine (0.66)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.55)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.34)

arXiv.org Artificial IntelligenceApr-21-2024

The Framework of a Design Process Language

Hagen, Arnulf

The thesis develops a view of design in a concept formation framework and outlines a language to describe both the object of the design and the process of designing. The unknown object at the outset of the design work may be seen as an unknown concept that the designer is to define. Throughout the process, she develops a description of this object by relating it to known concepts. The search stops when the designer is satisfied that the design specification is complete enough to satisfy the requirements from it once built. It is then a collection of propositions that all contribute towards defining the design object - a collection of sentences describing relationships between the object and known concepts. Also, the design process itself may be described by relating known concepts - by organizing known abilities into particular patterns of activation, or mobilization. In view of the demands posed to a language to use in this concept formation process, the framework of a Design Process Language (DPL) is developed. The basis for the language are linguistic categories that act as classes of relations used to combine concepts, containing relations used for describing process and object within the same general system, with some relations being process specific, others being object specific, and with the bulk being used both for process and object description. Another outcome is the distinction of modal relations, or relations describing futurity, possibility, willingness, hypothetical events, and the like. The design process almost always includes aspects such as these, and it is thus necessary for a language facilitating design process description to support such relationships to be constructed. The DPL is argued to be a foundation whereupon to build a language that can be used for enabling computers to be more useful - act more intelligently - in the design process.

knowledge management, logic & formal reasoning, machine learning, (22 more...)

2404.13721

Country:

Europe (1.00)
North America > United States > Massachusetts > Middlesex County (0.27)

Genre:

Summary/Review (1.00)
Research Report (1.00)
Collection > Book (0.45)

Industry:

Transportation (0.92)
Health & Medicine (0.67)
Shipbuilding (0.67)
Energy > Oil & Gas > Upstream (0.67)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(4 more...)

S, Arjun P, Melnik, Andrew, Nandi, Gora Chand

Exploring Unseen Environments with Robots using Large Language and Vision Models through a Procedurally Generated 3D Scene Representation

arXiv.org Artificial IntelligenceMar-30-2024

Recent advancements in Generative Artificial Intelligence, particularly in the realm of Large Language Models (LLMs) and Large Vision Language Models (LVLMs), have enabled the prospect of leveraging cognitive planners within robotic systems. This work focuses on solving the object goal navigation problem by mimicking human cognition to attend, perceive and store task specific information and generate plans with the same. We introduce a comprehensive framework capable of exploring an unfamiliar environment in search of an object by leveraging the capabilities of Large Language Models(LLMs) and Large Vision Language Models (LVLMs) in understanding the underlying semantics of our world. A challenging task in using LLMs to generate high level sub-goals is to efficiently represent the environment around the robot. We propose to use a 3D scene modular representation, with semantically rich descriptions of the object, to provide the LLM with task relevant information. But providing the LLM with a mass of contextual information (rich 3D scene semantic representation), can lead to redundant and inefficient plans. We propose to use an LLM based pruner that leverages the capabilities of in-context learning to prune out irrelevant goal specific information.

agent, information, representation, (13 more...)

2404.00318

Country:

Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Germany (0.04)
Asia > India > Jharkhand > Dhanbad (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Lima, Vinicius, de Oliveira, Jaque Almeida

Identifying Risk Patterns in Brazilian Police Reports Preceding Femicides: A Long Short Term Memory (LSTM) Based Analysis

arXiv.org Artificial IntelligenceJan-4-2024

Femicide refers to the killing of a female victim, often perpetrated by an intimate partner or family member, and is also associated with gender-based violence. Studies have shown that there is a pattern of escalating violence leading up to these killings, highlighting the potential for prevention if the level of danger to the victim can be assessed. Machine learning offers a promising approach to address this challenge by predicting risk levels based on textual descriptions of the violence. In this study, we employed the Long Short Term Memory (LSTM) technique to identify patterns of behavior in Brazilian police reports preceding femicides. Our first objective was to classify the content of these reports as indicating either a lower or higher risk of the victim being murdered, achieving an accuracy of 66%. In the second approach, we developed a model to predict the next action a victim might experience within a sequence of patterned events. Both approaches contribute to the understanding and assessment of the risks associated with domestic violence, providing authorities with valuable insights to protect women and prevent situations from escalating.

assessment, victim, violence, (14 more...)

2401.1298

Country:

South America > Brazil > Federal District > Brasília (0.04)
Europe (0.04)
South America > Brazil > Rio Grande do Sul > Porto Alegre (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Government > Regional Government > South America Government > Brazil Government (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Neural Information Processing SystemsApr-6-2023, 19:22:44 GMT

Oscillatory Model of Short Term Memory

We investigate a model in which excitatory neurons have dynamical thresh(cid:173) olds which display both fatigue and potentiation. The fatigue property leads to oscillatory behavior. It is responsible for the ability of the model to perform segmentation, i.e., decompose a mixed input into staggered oscillations of the activities of the cell-assemblies (memories) affected by it. Potentiation is responsible for sustaining these staggered oscillations after the input is turned off, i.e. the system serves as a model for short term memory. It has a limited STM capacity, reminiscent of the magical number 7 2.

oscillatory model, short term memory, staggered oscillation

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Cognitive Science (0.69)

Neural Information Processing SystemsApr-6-2023, 18:53:31 GMT

Analysis of Short Term Memories for Neural Networks

Short term memory is indispensable for the processing of time varying information with artificial neural networks. In this paper a model for linear memories is presented, and ways to include memories in connectionist topologies are discussed. A comparison is drawn among different memory types, with indication of what is the salient characteristic of each memory model.

neural network, short term memory

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

#artificialintelligenceMar-12-2022, 22:00:42 GMT

LSTM Vs GRU in Recurrent Neural Network: A Comparative Study

A recurrent neural network is a type of ANN that is used when users want to perform predictive operations on sequential or time-series based data. These Deep learning layers are commonly used for ordinal or temporal problems such as Natural Language Processing, Neural Machine Translation, automated image captioning tasks and likewise. Today's modern voice assistance devices such as Google Assistance, Alexa, Siri are incorporated with these layers to fulfil hassle-free experiences for users. The main difference between the RNN and CNN is that RNN is incorporated with memory to take any information from prior inputs to influence the Current input and output. Training methods for this network are the same.

information, lstm, term memory, (16 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceMay-28-2021, 22:35:55 GMT

LSTM Architecture

"Machine intelligence is the last invention that humanity will ever need to make " -- Nick Bostrom As we have already discussed RNNs in my previous post, it's time we explore LSTMs for long memories. Since LSTM's work takes previous knowledge into consideration it would be good for you also to have a look at my previous article on RNNs ( relatable right?). Let's take an example, suppose I show you one image and after 2 mins I ask you about that image you will probably remember that image content, but if I ask about the same image some days later, the information might be fade or totally lost right? The first condition is where we need RNNs ( for shorter memories) while the other one is when we need LSTMs for long memory capacities. For more clarification let's take another one, suppose you are watching a movie without knowing its name ( e.g. Justice League) in one frame you See Ban Affleck and think this might be The Batman Movie, in another frame you see Gal Gadot and think this can be Wonder Women right?

information, lstm, term memory, (11 more...)

Country: Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.26)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceApr-2-2021, 21:45:05 GMT

Creating Human Memory Structure in RNN

Machine learning and Artificial Intelligence developments are happening at breakneck speed! At such pace, you need to understand the developments at multiple levels – you obviously need to understand the underlying tools and techniques, but you also need to develop an intuitive understanding of what is happening. By the end of this article, you will develop an intuitive understanding of RNNs, especially LSTM & GRU. Have a look at this article on NLP. I took a handful of tweets and used the word count of positive versus negative words to classify the sentiment of the tweet.

architecture, information, term memory, (13 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.85)

#artificialintelligenceJul-1-2020, 07:55:48 GMT

Getting A Machine To Do My English Homework For Me

I've never liked high school English class. Maybe it's the fact that assignments are always super subjective. Maybe it's because the books that we're forced to read are long and boring. Maybe it's because Shakespeare is literally written in another language. What ends up happening because I don't really like English class is that I stop paying attention to what my teacher is saying and I don't read the books we're supposed to read.

artificial intelligence, machine learning, neural network, (18 more...)

Industry: Education (0.76)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)